# Multitask Speech Processing
Meralion AudioLLM Whisper SEA LION
Other
A speech-to-text large language model customized for Singapore's multilingual and multicultural environment, integrating Whisper-large-v2 speech encoder and SEA-LION V3 text decoder
Text-to-Audio
Transformers

M
MERaLiON
2,828
12
Kotoba Whisper Bilingual V1.0
Apache-2.0
Kotoba-Whisper-Bilingual is a distilled model collection trained from the Whisper model, specifically designed for Japanese and English speech recognition and speech-to-text translation tasks.
Speech Recognition
Transformers Supports Multiple Languages

K
kotoba-tech
782
13
Fsmn Vad
Other
FunASR is a foundational toolkit dedicated to bridging academic research and industrial applications in speech recognition, supporting various functions such as speech recognition, voice activity detection, and punctuation restoration.
Speech Recognition
F
funasr
107
17
Featured Recommended AI Models